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Abstract 

In this worl|^ we discuss the joint preceding with finite rate feedback in the so-called network 
MIMO where the TXs share the knowledge of the data symbols to be transmitted. We introduce a 
distributed channel state information (DCSI) model where each TX has its own local estimate of the 
overall multi-user MIMO channel and must make a preceding decision solely based on the available local 
CSI. We refer to this channel as the DCSI-MIMO channel and the preceding problem as distributed 
preceding . We extend to the DCSI setting the work from Jindal in [1] for the conventional MIMO 
Broadcast Channel (BC) in which the number of Degrees of Freedom (DoFs) achieved by Zero Forcing 
(ZF) was derived as a function of the scaling in the logarithm of the Signal-to-Noise Ratio (SNR) of 
the number of quantizing bits. Particularly, we show the seemingly pessimistic result that the number of 
DoFs at each user is limited by the worst CSI across all users and across all TXs. This is in contrast to 
the conventional MIMO BC where the number of DoFs at one user is solely dependent on the quality 
of the estimation of his own feedback. Consequently, we provide preceding schemes improving on the 
achieved number of DoFs. For the two-user case, the derived novel precoder achieves a number of DoFs 
limited by the best CSI accuracy across the TXs instead of the worst with conventional ZF. We also 
advocate the use of hierarchical quantization of the CSI, for which we show that considerable gains are 
possible. Finally, we use the previous analysis to derive the DoFs optimal allocation of the feedback 
bits to the various TXs under a constraint on the size of the aggregate feedback in the network, in the 
case where conventional ZF is used. 



'This work has been performed in the framework of the European research project ARTIST4G, which is partly funded by 
the European Union under its FP7 ICT Objective 1.1 - The Network of the Future. 
Preliminary results have been published in ISIT 2011, St. Petersburg. 
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I. Introduction 

Network MIMO channel, or multicell MIMO channels, whereby multiple interfering transmit- 
ters (TXs) share user messages and allow for joint precoding (downlink), are currently considered 
for next generation wireless networks [|2|-[|4|. With perfect message and channel state information 
(CSI) sharing, the different TXs can be seen as a unique virtual multiple- antenna array serving 
aU receivers (RXs), in a multiple- antenna broadcast channel (BC) fashion. 

Although the sharing of user data symbols can be made possible in certain situations, such as 
cellular networks with a pre-existing backbone infrastructure where user packets can be routed to 
several base stations simultaneously, the obtaining of accurate CSI at the TXs is made difficult 
due to the finite quantizing effects over the feedback channels and the limited capability of 
signaling between TXs to exchange the CSI. In addition, CSI exchange necessarily introduces 
further degradation due to latency effects over inter- TX links [|5|. 

This situation gives rise to an interesting information theoretic framework whereby a MEMO 
broadcast channel is formed (due to the assumed perfect user message sharing among the various 
TXs), yet the individual TXs composing the distributed multiple- antenna array have access to 
individual CSI estimates, possibly different from each other, and possibly of different quality 
(statistically). In this paper, we refer to this channel as the distributed CSI (DCSI)-MIMO channel. 
We emphasize the difference between this CSI model and the previously studied CSI models 
such as the so-called imperfect limited CSI [[T| or the delayed CSI model [[6| where the TX 
antennas are assumed to share ideally the same imperfect channel knowledge. 

Note that the sharing of the symbols via finite capacity links between the cooperating TXs has 
been discussed in recent works [|7|-[10|. This problem represents in itself a challenging topic, 
and we consider in the sequel perfect sharing of the users symbols. 

For the conventional MIMO BC, the impact of limited feedback [[T|, [[TT|-[[T6| and the 
derivation of robust solutions pP7| , [18| have been investigated, with later extensions to the 



multicell coordinated beamforming case [19| and the multicell MIMO case [20|-[22|. 

More recently, the optimization of the feedback allocations to the different users has been 



the focus of a large interest. It has been studied in conventional MIMO BCs [23 1, in multicell 



settings with coordinated beamforming [24|-[27|, in multicell MIMO networks [28|-|30| and 



in interfering BCs ||3TJ, ||32|. 

Yet, as mentioned before, these papers always consider perfect sharing between the TXs 
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preceding jointly the signal. In contrast, we consider here that each TX has its own imperfect 
estimation of the multi-user channel but all the TXs jointly precode the user's data symbols. 
This gives rise to a very different transmission setting which can be seen as a team decision 
problem [33|. Indeed, the precoder must cope not only with the inaccuracy of the CSI due to the 
hmited feedback channel capacity but also with the distributiveness of the CSI and the preceding. 
Each TX emits one component of the transmit signal vector which it computes based on its own 
channel estimate. As is pointed out in this work, the discrepancies between the channel estimates 
obtained by the different TXs are particularly detrimental to the channel capacity, and even to 
the Degrees of Freedom (DoFs), if not accounted for in the preceding design. 

The DCSI-MIMO scenario has meaningful applications to network MIMO schemes in cellular 



networks or MEMO based multi-TX cooperation in general. It was first studied in [34|, and a 
tractable discrete optimization at finite SNR was derived. However, the approach in p4| does not 
lend itself to a more general performance analysis, thus giving limited insight for an improved 
design. 

In this paper, we consider the performance of preceding schemes ever the DCSI-MIMO 
channel from a DoFs perspective. The number of DoFs represents the slope with which the rate 
increases with the SNR in the high SNR regime. Even though it is based on the high SNR 
analysis, it has been used widely used to gain insight into the wireless transmission thanks to 
its analytical tractability Q, p5( |. By essence, the DoFs analysis is not impacted by the unequal 
pathloss, which can put in question its practical signification in some settings. When all the 
wireless links present the same pathloss as it is the case in this work, this does not represent 
an issue. To extend the DoFs analysis to settings with large pathloss differences, it is then more 



adequate to use the notion of generalized DoFs [36| which takes the pathloss differences into 
account. We also assume that our system model is isolated from the rest of the world. In a 
practical scenario, it follows from the impossibility to serve jointly all the users that there is 
inevitably interference coming from outside the cooperation area. This implies that the number 



of DoFs is always zero [37|. The number of DoFs derived inside the cooperation cluster is then 
representative solely up to an SNR at which point the interference from outside the cooperation 
area leads to the saturation of the rate. 

Our work generalizes to the case of distributed CSI the finite rate feedback study by Jindal [[T| 
for the conventional multiple-antenna BC. In [jlj, the author derives the number of DoFs as a 
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function of the number of feedback (quantizing) bits exploited by each RX and shows that the 
number of bits must grow with the logarithm of the SNR in order to preserve the full number 
of DoFs, using ZF precoding arguments. We also consider ZF schemes as they are known to 
achieve maximum number of DoFs in wide setting^ Particularly, a necessary and sufficient 
feedback of the CSI estimation error for achieving the maximum number of DoFs is derived 
in pT| for the compound multiple-antenna BC. This condition is the same as the sufficient 
condition provided in [[T|. Thus, no other precoding scheme can achieve the maximal number of 
DoFs with a lower feedback scaling. This confirms the efficiency of ZF in terms of number of 
DoFs. As a consequence, we aim in this paper at answering the fundamental questions "Does 
conventional ZF also perform well in the distributed MIMO setting?", and "How can we make 
it more robust in that setting ? " 

Specifically, the main contributions read as follows. Let the number of bits quantizing the 
estimate at TX j of the normalized channel hf of user i be a^p{K-l) \og^{P) with af^ e [0, 1] 
and K the number of users. Then, we show that in a block fading Rayleigh channel: 

• The number of DoFs achieved at RX i with conventional ZF is equal to minj j^} ap-*. 
Hence, the worst accuracy across all the estimates limits the number of DoFs at each user. 
This is a pessimistic result and shows a different behavior compared to the conventional 
MIMO BC. 

• We provide a precoding scheme improving the number of DoFs. In the two-user case, the 
number of DoFs with the novel precoding scheme is limited by the best accuracy of the 
CSI across the two TXs instead of the lowest with conventional ZF. 

• To improve the number of DoFs achieved with more users, we introduce a concept of 
hierarchical quantization of the CSI and we show that this leads to a dramatic improvement 
of the number of DoFs. 

• Under a total feedback constraint and with ZF schemes, we derive the number of DoFs 
maximizing allocation of the feedback bits toward each TX. 



Note that this paper serves to generalize preliminary results that were presented in [38 1. 



^Note that the selection of the set of users actually transmitting during one time slot is not considered in this work. In fact 
the formula for the number of DoFs provided in this work can be used to derive a set of transmitting TX achieving a good 
number of DoFs, i.e., to use a good combination of ZF precoding and time sharing. 
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Notations: We denote by nA(») and n^(«) the orthogonal projectors over the subspace 
spanned by the matrix A and over its orthogonal complement, respectively, i denotes the comple- 
mentary indice of i when only two users are considered, i.e., i = i mod 2+1. ||»||f designates the 
Frobenius norm while A/^(/i, a^) denotes the complex circularly symmetric Gaussian distribution 
with mean ji and variance cr^. We also denote the ith element of a vector a by {a}i and the 
{i, j)th element of a matrix A by {A}ij. Additionally, we use the notation < to denote a relation 
of order which holds true asymptotically. We also write f{x) — o{g{x)) (resp. f{x) — 0{g{x))) 
to represent the fact that lim3;^oo f{x)/g{x) = (resp. Iim3;^oo \fix)\/\9ix)\ ^ '^■> with a > 0). 
We also write f{x) ~ g{x) to denote the fact that f{x) = g{x) + o{g{x)). 

II. System Model 

A. Multicell MIMO 

We consider a joint downlink transmission from K TXs to K RXs using linear precoding and 
single user decoding. For ease of exposition, the TXs and the RXs are equipped with only one 
antenna, but the principal elements of our approach could extend in principle to more antennas 
at the TXs. Similarly, we consider a Rayleigh fading scenario but the approach derived should 
be valid in many other fading scenarios. The transmission can be described as 
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(1) 



where y = £ C^^^ contains the received signals at the RXs, the vector x = 

[xi, . . . ,xkY ^ C^^^ is defined such that Xj is the signal transmitted by TX j, and rj = 
[r]i, . . . , r]K]'^ E C^^^ contains the noise realizations at the RXs and has its entries i.i.d. as A/'(0, 1). 

The vector hf e C^^^ is the channel from all TXs to the i-th RX and define the normal- 
ized channel to user i as hi = We also define the multi-user channel matrix H = 
[hi, . . . , hx]^ and its normalized counter-part H = [hi, . . . , fiK]^- 

The channel is assumed to be block fading and the entries of the channel matrix H to be i.i.d. 
as A/^(0, 1), modeling a Rayleigh fading channel. The transmitted signal x is obtained from the 
vector of transmit symbols s = [si, . . . , ^ C^^^ (whose entries are taken as i.i.d. M{Q, 1)) 
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where T G C^^^ is the multi-user preceding matrix and tj G C^^^ is the beamforming vector 
used to transmit Sj. Even though a per-TX power constraint is the most relevant power constraint 
in the multicell setting, we consider a sum power constraint ||T||p = P. We also assume for 
simplicity and symmetry that all data streams are allocated with an equal amount of power so 
that ti = P/Kui with Hwip = 1. These choices can be done without restricting the scope 
of this work because they do not have any impact on the number of DoF^ We will study 
the ergodic rate averaged over the random codebooks used for the CSI Random Vector 



Quantization (RVQ), as detailed in Subsection |II-B[ The ergodic rate for RX i reads then as 

|2 



MP) 



log2 1 + 



(3) 



To achieve the maximal number of DoFs we aim at removing completely the interference at all 
the RXs, i.e., at having 



0. 



(4) 



K 

\/te{l,...,K}, Yl l^^^^l 

From Q and the equal power allocation, there is no coupling between the optimizations of the 
beamforming vectors which can then be carried out in parallel. The number of DoFs achieved 
at RX i is defined as 

MP) 



DoF, = lim 



P->oolog2(P)' 



(5) 



and the total number of DoFs is DoF = J2t=i DoFj. From the above definition of the number 
of DoFs and definition (|3]), we can directly obtain that Vi G {I, . . . , K}, 



DoFi = 1 - lim E , u). 



log2(P) 



(6) 



^Indeed, it is always possible to scale the total power used when considering the sum power constraint so as to fulfill the 
per-TX power constraint without impacting the number of DoFs. Similarly, optimally allocating the power does not change the 
number of DoFs. 
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B. Distributed CSI 

1 ) CSI Scaling Coefficients: We assume a limited CSI setting where channel estimate inac- 
curacies are modeled using quantized feedback. Furthermore, a distributed CSI model is defined 
here in the sense that each TX has its own individual estimate of the normalized channel hi to 
RX i. Moreover, the estimates for the different channel vectors hi are also a priori of different 
qualities at each TX, i.e., quantized with codebooks of different sizes. We denote by the 
estimate of the normalized channel vector hi acquired at TX j. The quantized feedback consists 
of bits which are used to index a vector in the codebook Wp'' made of 2^'^' elements. We 
also define H*^-'^ = [h^i \ . . . , h^x]^ as the estimate of the total normalized multi-user channel at 
TXj. 

This setting arises in the context of multi-TX cooperation (e.g. Network MIMO Q) where 
either (?) all TXs obtain a version of the whole CSI matrix through independent feedback 
channels (in which case the quality of the uplink feedback channel determines the quality of 
the individual CSI estimates) or (ii) each TX obtains some portion of the CSI and exchange it 
through limited rate links or/and with some latency to the other TXs. 

In the conventional MEMO BC, it is shown in [1] that the number of quantization bits should 
scale indefinitely with the logarithm of the SNR in order to achieve a strictly positive number of 
DoFs when using ZF precoding. Thus, we also focus on the scaling in the logarithm of the SNR 
of the number of quantization bits of all the channel estimates. We introduce the CSI scaling 
matrix cx G M^^^ with its (i, j)-th element defined as 



a'^^^ = lim — . (7) 

Hence, a\-'^ denotes the scaling of the number of bits used to describe the channel of user i 
at TX j. Since Bi''^ is a design parameter, the limit in Q can be seen to always exist. We 
furthermore assume that the CSI scaling matrix ct is known to all the TXs. 

Remark: We will always consider for notational clarity ap'' G [0, 1] as the range of interest. 
This follows from the fact that ifaf^ = 1, it then holds \hft[^'^\'^ = 0(1) for i ^ i l^. The 
accuracy of CSI resulting from a CSI scaling coefficient equal to one is sufficient for the interfer- 
ence to remain bounded. Thus, increasing the number of CSI feedback bits to get > 1 does 
not increase the number of DoFs. This corresponds to a well known result for the conventional 
MIMO BC in /?/. It follows that in all the subsequent results, the scaling coefficients a^ '' should 



B 



(i) 
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be replaced by m\n{af \ 1) so as to be valid for arbitrary values for the CSI scaling coefficients. 
This is not done to keep the notations as clear as possible. 

2) Random Vector Quantization for the DCSTMIMO Channel: We consider RVQ where 
random codebooks are used to quantize the channels. This follows a result in [1| for the 
conventional MIMO BC, stating that in the case of two antennas at the TX, no codebook can 
achieve a better number of DoFs than the number of DoFs achieved with RVQ. RVQ is also 
shown to be optimal for the point-to-point MIMO link as the number of antennas tends to infinity 



both at the TX and the RX p9| . Finally, RVQ is interesting because it gives an achievable lower 
bound. 

In most of the works regarding the conventional MIMO BC, a codeword w is selected for 
quantizing the unit-norm vector hi if it maximizes the amplitude of the inner product |. 
However, in the DCSTMIMO channel, this quantization scheme is less adequate because the 
objective is invariant by multiplication of the codeword by a unit-norm complex number. This 
represents a problem since a different estimate is received at each TX, and this phase invariance 
creates an ambiguity between the estimates. This is very harmful for the transmission scheme 
and, in fact, if such a quantization scheme is used, it can be easily shown that the channel 
estimate obtained is essentially useless for joint precoding. 

Thus, another quantization scheme is preferred and the quantized channel h.p-' is instead 
obtained in the optimum L2 norm sense: 

h\-'^ = argmin — (8) 

Using directly ([8]) leads to lower performance as the phase of the channel also impacts the 
performance, and not only the direction in a Grassmannian space. To recover similar performance 
as the quantization scheme conventionally used, we multiply all the elements of the codebook 
as well as all the normalized channels by a complex number so as to let their first coefficient 



be real valued. A detailed analysis of this quantization scheme is provided in Appendix X-A 



C. Distributed Precoding 

In the DCSI-MIMO channel, each TX has a different estimate of the multi-user channel H 
and controls only one antenna. Thus, each TX uses its CSI to compute a certain precoding 
matrix from which it extracts the coefficient corresponding to its antenna. We denote the overall 
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where t'f'^ is the beamforming 



muki-user precoder computed at TX j as T*^-'^ = t\ 
vector designed to transmit symbol Sj. 

Note that although a given TX j may compute the whole precoding matrix T^^\ only the 
j-th row ejT*^-'^ will be used in practice, since TX j transmits only Xj = e^T^^^s. Finally, the 
effective precoder is then given by 



eTT(2) 



-K 



(9) 



The main elements of the transmission in the distributed CSI MIMO channel are illustrated in 
Fig-E 
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III. Review of the Results in the Conventional MIMO BC 

In this section, we recall briefly the main results from [[T| on the number of DoFs achieved 
with finite rate feedback in the conventional MIMO BC. This will be helpful to understand the 
differences between the conventional MIMO BC and the distributed CSI setting which is the 
main focus of this work. 

Hence, we consider in this section a conventional MIMO BC where M TXs are colocated and 
share the same channel estimate. For this setting, we need to use different notations as previously 
introduced for the DCSI-MIMO channel. We denote by hi the channel estimate of hi obtained 
with Bi bits. Following [1], the channel estimate is obtained from 

hi = argmax \w^hi\'^ (10) 

where Wf*^ is a random codebook containing 2^* unit-norm vectors isotropically distributed 
in C^^^ We provide now the main result. 

Theorem 1. fl^] In the MIMO BC with M antennas, if the channel estimate hi is obtained from 



the quantization scheme (\Q) with Bi = ai{M — 1) log2(P), the number of DoFs achieved with 
ZF is given by 

M 

DoF^^ = ^ai. (11) 



i=l 



This result was given in [[T| for = a but the extension to different ctj follows directly from 
the proof in [ 1 1. The extension to Theorem [T] has been suggested in |40| where the same formula 
for the number of DoFs is derived in the case where DPC is used instead of ZF. 

We will now derive the equivalent result of Theorem [T] for the DCSI-MIMO channel where 
the TXs do not share the same channel estimates. 



IV. Zero Forcing in the DCSI-MIMO Channel with Two Users 

As a starting point we consider the particular configuration with only two users. This setting 
is interesting for two main reasons. Firstly, the exposition is simpler in that case while most of 
the insights are the same as in the general case, and secondly this scenario makes it possible to 
obtain stronger results. 
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In the conventional multiple- antenna BC with imperfect CSI, the number of DoFs with ZF 
has been derived and shown to be defined by the CSI scaling. In the DCSI-MIMO channel, the 
CSI scaling of each channel vector hi is different at each TX. One central goal of our work 
consist in determining how the formula for the number of DoFs in the conventional MIMO BC 
generalizes to the DCSI-MIMO channel. This would then lead us to evaluate whether ZF is in 
that case a performing solution and if not, whether one can find better solutions. 

A. Conventional Zero Forcing 

In the DCSI-MIMO channel, the conventional ZF precoder is made of the beamformer ff"^ = 
[e^t^^^*^^"*, e|t^^^^^^]^ to transmit Sj, with its elements defined in an intuitive way as 



The interpretation behind conventional ZF is that each TX applies ZF using its own CSI implicitly 
assuming that the other TX shares the same CSI estimate. Our first result given in the following 
theorem relates the number of DoFs achieved with such a precoding strategy. 

Theorem 2. Conventional ZF achieves the number of DoFs 

DoF"^^ = 2 min a^/\ (13) 

ij6{l,2} 



Proof: A detailed proof is provided in Appendix X-B 



We can observe that in the case of distributed CSI, the number of DoFs is limited by the 
worst quality of the CSI across the channels to the RXs and across the TXs. Comparing this 
result with the number of DoFs achieved in a conventional MIMO BC given in Theorem [T| it 
is remarkable that the number of DoFs at both users is limited by the worst estimation error 
whether it is done relative to hi or h.2. This is contrast to the formula for the conventional 



MIMO BC in ( [T3| ) where the accuracy of the estimation of hi impacts only the number of DoFs 
of RX i. 

Note that when all the CSI scaling coefficients are equal, the setting considered is still different 
from the conventional multiple- antenna BC. Indeed, the estimates at the different TXs have 
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Statistically the same accuracy since the CSI scaling coefficients are equal, but the realizations 
of the estimation errors are still different. 

One can conclude from Theorem |2] that the additional interference due to the CSI inconsistency 
between the TXs does not lead to any loss in number of DoFs compared to the conventional 
multiple-antenna BC if and only if the channel estimates are of the same quality. 



B. Robust Zero Forcing 

Robust precoding schemes have been derived in the literature either as statistical robust ZF 
precoder or precoder optimizing the worst case performance to reduce the harmful effect of 
the imperfect CSI. Since we consider the average sum rate, the most relevant approach is the 
statistical one. Thus, we model the quantization error at TX j by an additive white Gaussian 

noise A'^-'^ = [^^•'^ ^g"*^]^ of variance equal to P""^^' for the estimation error (Jp-* resulting from 

_ (j) 

the quantization of hi at TX j. The variance P is obtained from the analysis of the scaling 



of the estimation error which is given in Appendix X-A 



The covariance matrix of the estimation error at TX j is then R^-* = E[A*^-'^(A(-'^)^] = 

(i) (i) J I 

diag([P~"i , P~"2 ]). Using this model, we can extend the approach from [17| and the beam- 
former transmitting symbol Si at TX j is obtained from solving the following minimization: 

P 

argminE^U)[||ei - H^-^'^tif ], subject to ||t,f = — . (14) 

Writing the Lagrangian of the minimization problem with the Lagrange variable A for the power 
constraint and taking the derivative according to t* yields the equation 

(n'i! + h(^)«H(^) + Al) ti - H(^)"e,; = 0. (15) 

The factor A improves the performance at intermediate SNR by striking a compromise between 
the orthogonality constraint and the power consumption but it cannot improve the number 
of DoFs. Thus, we can let A be equal to zero and normalize the beamformer to fulfill the 
power constraint. The robust ZF beamformer transmitting symbol Sj is denoted by t-^^ = 
[eJtr''\eltr^'^V andVjG{l,2} 

^rzm A ^ ^/^ ^^(Ri^ + HO-)HHO-))-^HO-)He. ^^^^ 
Fl^^ + HO-)HH(j))-iHO-)Hei 

We then derive the number of DoFs achieved by this robust precoder. 
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Proposition 1. The robust ZF precoder defined in ([16]) achieves the same number of DoFs as 
conventional ZF. 



Proof: Considering strictly positive CSI scaling coefficients, the variances of the estimation 



errors tend to zero so that the inverse term in ( fT6l ) can be approximated and we can write at 
RX ~r. 



K 



r(() + HO-)HHO-))-iHO-)Hei 
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^H(HO-))-i(i-(HO-)H)-iRa)(HO-))-i)e.p 

— 2 ^ oU|Ra IIf 

^rW + HO-)HHO-))-iHO-)Hei 



) 

) 



(17) 



(18) 



(19) 



The difference with conventional ZF is the term (H^-'^^) ^R^''(H'^''^) ^ which can be shown 
to lead to no reduction of the interference and introduces actually an additional error term. 

diag([p-"i'',p-"2'']). This is also the 



Yet, it converges to zero as P min("?'."2^') since R^-* 



rate at which the remaining interference tends to zero when using conventional ZF. Thus, the 
regularizing term vanishes and the number of DoFs achieved is the same as conventional ZF. ■ 
Hence, even the existing designs of robust ZF precoders do not improve the number of DoFs 
in the DCSI-MIMO channel. Note that the extension of the definition of the statistical robust 
precoder as well as the extension of proposition [T] to the general setting with K users is trivial 
and will not be given explicitly. 



C. Beacon Zero Forcing 

Robust ZF schemes from the literature do not bring any DoFs improvement which leads to 
investigate other alternative schemes more adapted to the DCSI-MIMO channel. As a result, 
we now propose a modification of the conventional ZF scheme which improves the number of 
DoFs when the estimates for h\ and h,2 are of different qualities. We call it Beacon ZF (bZF) 
because it makes use of an arbitrary channel-independent vector known beforehand at both TXs 
(a beacon signal). 
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The beamformer used to transmit symbol Sj is then t^^^ = [ejt^'^^^^\ e'lt"'^^^'^^]'^ , with its 
elements defined from ^ 

2||n^,,(c.)|| ^'"^ 

i 

where Cj is any non-zero vector chosen beforehand and known at the TXs. Due to the isotropy 
of the channel, the choice of Cj does not influence the performance of the precoder. 

Corollary 1. The number of DoFs achieved with beacon ZF is 

DoF''^^ = min a\^^ + min a^^\ (21) 
ie{i,2} ie{i,2} 



Proof: The number of DoFs follows easily from Theorem |2] Indeed, when using beacon 
ZF, no error is induced by the projection of the direct channel which is replaced by a fixed 
given vector. In terms of number of DoFs, there is no difference between projecting the direct 
channel or any given vector. Thus, it is possible to apply the formula for the number of DoFs 
in Theorem |2] considering that the direct channel is perfectly known, which yields the result. ■ 
The key idea behind beacon ZF is to reduce the impact of the differences in CSI quality by 
using only the CSI necessary to fulfill the orthogonality constraint. Thus, the direct channel, 
which does not change the number of DoFs but only improves the finite SNR performance, is 
not used. It follows then that does no depend on the estimates of hi, and symmetrically 
does not depend on the estimates of ^2- 

D. Active -Passive Zero Forcing 

Beacon ZF improves the number of DoFs but it is still the worst CSI scaling across the TXs 
(although no longer across the RXs) which defines the number of DoFs. To improve further the 
number of DoFs, we propose a scheme called Active-Passive Zero Forcing (AP ZF). Assuming 
w.l.o.g. that a^^^ > a^^\ AP ZF consists in the precoder whose beamformer tf^^^ transmitting 
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symbol Sj is given by 



t 



APZF A 



P 



21og2(P) 



1 



L {hT'h. 



{2)^ 



21og2(P) 



.APZF 



(22) 



(23) 



where 



.APZF A 



(24) 



andpr4|{/.f)},|V|{^f},P. 

AP ZF is based on the idea that each beamforming vector has to fulfill only one orthogonality 
constraint so that only one available variable is necessary. Thus, one coefficient can be set to a 
constant while still fulfilling the ZF constraints. Moreover, the only way to achieve the number 
of DoFs stemming from the best CSI estimate is if TX 2 (which has the best knowledge of hi) 
can adapt to the coefficient transmitted at TX 1 to adjust its beamforming vector and improves 
the accuracy with which the interference are suppressed. This is possible only if TX 2 knows 
the transmit coefficient at TX 1. 

Using this precoding scheme, the number of DoFs is then given in the following proposition. 



Proposition 2. Active-Passive ZF achieves the number of DoFs: 



DoF^P^^ > max + max a'i 



(i) 



J6[l,2] 



(25) 



Proof: By symmetry, we consider w.l.o.g. the number of DoFs at RX 1, and we assume 
that the beamformers ti and ^2 are given by ([23]). We still assume w.l.o.g. that af^ > ^ i.e., 
TX 2 has the best CSI over hi. From (|6]), the number of DoFs at RX 1 is 

Eh,{W.,,} [iog2(i^?t2r 



DoFi= 1 

We now focus on the interference term: 

\h%\ 



lim 

P->oo 



l0g2(P) 



P 



21og2(P) 





1 




hf 













(26) 



(27) 



ACCEPTED FOR PUBLICATION IN IEEE TRANSACTIONS ON INFORMATION THEORY. 



17 



(2) 

By construction, ±2 is orthogonal to h\', so that 



21og2(P) 
21og2(P) 



.(2)H 



(28) 
(29) 



Inserting p9| ) in the DoFs expression ([26]) and using Proposition 11 from Appendix X-A to 
bound the expectation of the sinus, we obtain 



E- 



DoFi > lim 



> lim 



H,{W,,j} 



-log2 sin {huh'^ 



(2)^ 



l0g2(P) 



(2) 



P^^ log2(P) 



(2) 



(30) 

(31) 
(32) 



which is the best scaling across the TXs. ■ 
Comparing the number of DoFs achieved with AP ZF with the number of DoFs achieved when 
both TXs share the estimate of a channel vector with the highest accuracy gives the following 
result. 

Theorem 3. Active -Passive ZF achieves the same number of DoFs in the 2-user DCSI MIMO 
channel as in the conventional MIMO BC where both TXs share the estimates with the highest 
CSI accuracy. 

Improved scheme at finite SNR: AP ZF allows to recover the number of DoFs which would 
have been achieved with the best CSI across the TXs. However, the choice of the coefficient 
used to transmit at TX 1 (with the lowest accuracy of the CSI) remains to be discussed. In 
fact, the beamformer can be multiplied arbitrarily by any unit-norm complex number without 
impacting the rate achieved so that only the power used at TX 1 needs to be decided. According 



to (|23]), the power used at TX 1 is set to P/(2 log2(P)). 

The normalization by log2(P) is done because the fading coefficient {^1)2 rnight have a very 
small amplitude. In this case it would be necessary for TX 2 to transmit with a very large power 
to fulfill the orthogonality constraint. To ensure that the interference are canceled for all channel 
realizations while respecting the power constraint, it is necessary to have the ratio between the 
power used at TX 1 and the sum power constraint tending to zero. The factor log2(P) is used 
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because it fulfills this property while not reducing the number of DoFs due to the partial power 
consumption. 

However, this comes at the cost of using only a small share of the available power, which 
is clearly inefficient and leads to a rate offset tending to minus infinity. To avoid this behavior, 
we propose that the TX with the worst CSI accuracy adapts its power consumption with respect 
to the channel realizations. In the following, we propose two possible solutions to improve the 
performance at finite SNR: 

• Firstly, TX 1 can use its local CSI to normalize the beamformer which is then given by 



with pf^ = |{/i^^^}ip/|{/i^''^}2p, for j — 1,2. This beamformer is not DoFs maximizing 
because the local CSI is used at TX 1 so that TX 2 does not any longer have an exact 
knowledge of the coefficient used to transmit at TX 1. Consequently, beamformer tf^^^ 
is not any longer orthogonal to hj . Yet, this solution achieves good performance at 
intermediate SNR. 

• Another possibility is to assume that TX 1 receives the scalar pf^ (or pi) and use it to 
control its power. This means that TX 2 needs to share this scalar. This requires an additional 
feedback, but only a few bits are necessary to improve the performance at practical SNR. 

V. Zero Forcing in the DCSI-MIMO Channel for Arbitrary Number of Users 

In this section, we will show how the main results can be generalized to arbitrary number 
of users. The same approach as in the case K = 2 can be followed and we start by briefly 
generalizing to arbitrary number of users the precoding schemes previously described. 

A. Conventional Zero Forcing 



The conventional ZF precoder will be denoted as T*^^^ = [^i^^, ■ ■ ■ , t^^] with t'f'^ = 
[e[tf'^'^^\ e\tf'^'^'^\ . . . ^ e\tf'^'^^^Y transmitting symbol Sj, and the beamformer tf'^^^'' com- 



1 




(33) 



puted at TX j to transmit symbol i given by 




(34) 
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We can then generalize the results from Theorem |2] to an arbitrary number of users. 

Theorem 4. In the DCSI-MIMO channel, the number of DoFs achieved with conventional ZF 
is equal to 

DoF^^^ = K min (35) 
Me{i,...,x} 



Proof: A detailed proof is provided in Appendix X-B ■ 
In Theorem |4} we have shown that the results concerning conventional ZF can be exactly 
generalized and the number of DoFs scales with the worst CSI accuracy across the TXs and 
the RXs. Indeed, the bad estimation of the channel to one user at one TX reduces the number 
of DoFs of all the users. This is very pessimistic and represents a different behavior as in the 
conventional multiple-antennas BC. This can be observed by comparing the number of DoFs 



for the conventional MIMO BC in (11) with the formula for the number of DoFs in the DCSI- 



MIMO channel given in ([35]) when = 1, . . . ^K^af^ = ai, i.e., the CSI qualities are the 



same at all the TXs 

B. Beacon Zero Forcing 

The beacon ZF precoder is denoted as T^^^ = [t\'^^ ,±2'^^ . . . ,t^^] with the beamformer 
t^^^ — [elt^'^^^^\e2t^'^^^'^\ . . . ,e\-t^'^^^^^Y- transmitting symbol Sj. The beamformer t°'^^^^'^ 
computed at TX j to transmit symbol Sj is given by 

i 

where Cj is any non-zero vector chosen beforehand and known at all TXs. 
Proposition 3. The number of DoFs achieved with beacon ZF is equal to 

K 

DoF^^^ = V min min (37) 

j-^ iG{i,...,K}, e,je{i,...,K}, 



Proof: To derive the number of DoFs at a RX k, we need to compute the scaling of the 
interference at RX k stemming from the transmission to the K — 1 other RXs. In the proof 
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of Theorem |4} it is in fact the scaling of the interference resulting from the transmission of 
one stream which is calculated. To obtain the number of DoFs at one RX, the scaling of the 
interference resulting from the transmission of each of the K — 1 interfering streams needs to 
be computed. This is represented by the first summation over i. Determining the interference 
leaked by the transmission of symbol Si using beacon ZF leads to the second minimum in the 
formula. ■ 
We have derived the number of DoFs for beacon ZF, but we will show in the following 
corollary that beacon ZF is only attractive in terms of number of DoFs in the two-user case. 

Corollary 2. For K > 3, beacon ZF achieves the same number of DoFs as conventional ZF. 

Proof: The result is easily obtained by studying the effect of the two successive minimums 
in dSTl). ■ 



C. Active-Passive Zero Forcing 

The generalization of AP ZF is intuitive and consists simply, for the computation of each 
beamforming vector, in letting one TX arbitrarily fix its precoding coefficient while the other 
TXs adapt to this coefficient. Nevertheless, it requires the introduction of a few more notations. 

We define the ordered set § = {rii, . . . , hk} as the set whose i-th element corresponds to the 
indice of the TX with fixed coefficient when transmitting the symbol Sj (passive TX for Sj). We 
then introduce the (column) channel vector from TX ^ to all the RXs except the i-th RX: 

^p)(£) ^ [{H(^)}i,,, . . . , {H^^')}.+i,., . . . , {H(^)};,,,]'^. (38) 

Using the previous definition, we can then define 

^'^ind = [^FHl), • • -^g^in, - + !),•• ■,9l'\K)] (39) 

which represents the estimate at TX j of the multi-user channel from all the TXs except TX Ui 
to all the RXs except RX i. 

For a given set §, we write T^^^^{S) = [tf^^^{ni),tf^^{n2), . . . ,t^^^^{nK)] where the 
beamformer tf^^^(r2j) = [e]-tf^^^^^^(r2j), e|tf^^^^^^(nj), . . . , e^tf^^^^^^(nj)]^ transmits sym- 
bol Si. The beamformer tf^'^^^-'\ni) computed at TX j to transmit symbol Sj is given by 



ACCEPTED FOR PUBLICATION IN IEEE TRANSACTIONS ON INFORMATION THEORY. 



21 



where we have defined 

APZF(i)/ ^ .APZF(j)/ ^ . APZF{j) / ^ . . APZF(i) / ^ „>APZF(i) 



li 



(r^O, • • • , C-i7i^0. • • • , (41) 



T 



G C^-i and 



,APZFO)/ N A 

With tt- n,- = 



nf^^^^V.) ^ , - (42) 

Even though the notations are quite heavy, the intuition behind the construction of the precoder 
is exactly the same as for the two-user case. TX rii is the passive TX and transmits with a fixed 



coefficient ^JP jK log2(P) while the other active TXs then choose their coefficients in order 



to ZF the interference. This is obtained by setting their coefficients so as to fulfill ( |40| ). The 
notational complexity comes only from the fact that we need to introduce a "reduced" channel 
without the direct channel as well as without the channel from the passive TX. 

Proposition 4. Active-Passive ZF with the set § = {^i, . . . ,nx} achieves the number of DoFs 

K 

DoF'^P^^rS) = V min min a^p . (43) 

f:^iG{l,...,i^}, i,3&{l,...,K}, 



Proof: Due to the symmetry between the RXs, we will show the result only for the number 
of DoFs at RX k. Let assume that AP ZF is used with the set §. To obtain the number of 
DoFs, we need to derive the scaling of the interference at RX i when all streams are transmitted 
using AP ZF. The first minimum of the DoFs formula follows from the summation over all the 
K — 1 interfering streams. It remains then to determine the scaling of the interference resulting 
from the transmission of one given data symbol. 

TX j computes the beamformer tf^'^^^''\ni) according to ( |40l ). This formula is similar to the 
one for conventional ZF so that the scaling of the remaining interference power can be derived 
with a proof very akin to that of Theorem |4] which is omitted to avoid repetitions. Thus, the 
interference received at RX k due to the transmission of symbol Si corresponds to the second 
minimum of the DoFs formula. This expression follows from the fact that the CSI at TX and 
the CSI on the direct channel are not used to design the beamformer transmitting S£. ■ 
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The number of DoFs given in Proposition |4] is given by two successive minimizations. This is 
similar to beacon ZF at the difference that the index of one TX is not taken into account in the 
second minimization. This leads then to a larger number of DoFs. The formula for the number 
of DoFs depends on the set § but we will show that the optimal set is easily derived when the 
number of users is larger than 4. 

Corollary 3. For K > A users, it is optimal in terms of number of DoFs to choose all the 
indices in § to be equal. Therefore, it is optimal to choose Ui as the indice of the minimum over 
all the CSI scaling coefficients, and the number of DoFs reads as 

DoF^P^^ = K min (44) 

i,jG{l,...,A'}, 

(fc) 

jT^argmin^ min^ 



Proof Similar to the proof of the corollary for Beacon ZF, the proof follows by studying 
the effect of the two successive minimums and for K > A, ii has for consequence that it is 
optimal to choose yi,j,ni = nj. ■ 
Exactly as in the two-user case, AP ZF leads to an improvement in number of DoFs but 
this comes at the cost of an unbounded negative rate offset. To improve on this feature, the 
percentage of the available power which is consumed by the TXs needs to be increased. The 



sames solutions as described for the two-user case in Subsection IV-D can be applied, i.e., either 
a heuristic power control or the transmission of a scalar to control the power. Note that the scalar 
can be transmitted by any of the other K — 1 TXs and that one scalar needs to be transmitted 
for each stream. We refer to Subsection IIV-DI for more details. 



D. Discussion of the Results 

Altogether, we have shown in this section that the results for the two-user case given in 



Section IV could generalize to an arbitrary number of users. However, the results suggest in all 
cases a fundamental lack of robustness of the performance as we increase the number of users. 
Indeed, with conventional ZF, a single inaccurate channel estimate can reduce the number of 
DoFs of all the users while the novel precoding schemes proposed can only cope with a few 
channel estimates being of insufficient quality. This shows the need for other methods to make 
the transmission more robust to imperfect distributed CSI when more than two-user are present. 
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VI. Precoding Using Hierarchical Quantization 

In view of the rather pessimistic results in the previous section, we propose now an alternative 
method to make the transmission more robust to the CSI discrepancies. It consists in modifying 
the CSI quantization and using a Hierarchical Quantization (HQ) scheme to encode the CSI [ 134| , 

EH- 

A. Hierarchical Quantization 

Hierarchical quantization (or multi-resolution quantization) is a quantization scheme in which 
the information is encoded so that the original message can be decoded up to a number of bits 
depending on the quality of the feedback channel. The better the channel is, the more bits can 
be decoded. Thus, if one entity receives a codeword with a higher accuracy than another entity, 
and has the knowledge of the feedback qualities, it also knows what has been decoded at the 
other entity. Conversely, if one entity can detect the feedback information at a given resolution 
level but knows that another entity can decode the same information at a higher resolution level, 
it can use its individual decoded codeword to form a limited set of guesses around it as to which 
higher resolution codeword may have been detected at the other TX. 

In our setting, it means that each TX can decode the CSI feedback up to a certain number 
of bits depending on the quality of the feedback link. If TX ji receives a CSI of better quality 
than another TX j2, it can decode more bits from the CSI and can get the knowledge of the CSI 
at TX j2 with less decoded bits. Note that this implies that two TXs with the same CSI quality 
have the same codebook and thus exactly the same realization for the channel estimation error. 
This is in contrast to what has been considered in the previous sections. 

We wish to continue using the properties of RVQ so that we need to design hierarchical 
random codebooks, i.e., codebooks fulfilling the properties of both kinds of codebooks. Since 
this is not the main focus of the work, we just briefly describe a possible method to construct 
such codebooks and the quantization scheme associated. 

We start by considering a random codebook of size corresponding to the best accuracy, say 
2^max jj^jg random codebook is then divided into two random codebooks containing each half 
the elements. This process is then applied on the two smaller codebooks obtained until having 
2^max codebooks of one element. In each of the sub-codebooks of different sizes created, we 
pick randomly one elements to be the representative of this codebook. 
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Once the quantized vector maximizing the figure of merit has been chosen among the l -^^-^ 
vectors, the encoding can be easily done. The chosen vector belongs to one set of each size 
and the encoding bits are used to select among the two possible choices, the set to which the 
quantized vector belongs. 

The decoding step works as follows. The first bit denotes one of the two codebooks of size 
2^max-i^ the second bit denotes one of the two codebooks of size 2^™=''="^ inside this codebook, 
and so on, until the last bit is decoded. Once this is done, the codeword decoded is chosen to 
be the representative codeword of the obtained codebook. 

It is then easily verified that the proposed quantization scheme has the hierarchical properties 
desired. 

B. Conventional Zero Forcing with Hierarchical Quantization 

In the previous sections, we have shown that the quality of the estimation of one channel hi 
to one given RX had an impact on the number of DoFs achieved at all RXs. This is a surprising 
property which follows from the particular structure of the DCSI-MIMO channel where the 
consistency between the transmissions of the different TXs is critical. We will show how the 
hierarchical quantization described above can be used to avoid this very inefficient property. 

In the following, we will consider a particularly simple use of hierarchical quantization 
consisting in letting all the TXs designing the beamforming vector use only the part of the CSI 
which is common to all the TXs, and simply "forget" about the more accurate CSI knowledge. 
We then obtain a CSI configuration where all the TXs share the same CSI and the number of 
DoFs can be obtained from Theorem [T] 

Theorem 5. The number of DoFs achieved using Conventional ZF with hierarchical quantization 



IS 



K 



DoF' 



.cZF 




(45) 



1=1 



Using HQ as described, i.e., using only the estimate of a channel vector hi common to all 
the TXs, follows from the observation that the worst estimation error of hi limits in any case 
the number of DoFs at RX i. Thus, using only the common part of the estimate of hi does 
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not reduce the number of DoFs at RX hi. Yet, it leads to an improved consistency between the 
beamformers computed at the TXs. This has for consequence that the error in the estimate of 
the channel hi only impacts the number of DoFs at RX i and not at the other RXs. 

Note that the proposed scheme using HQ is very simple and more gains could certainly be 
obtained with a more sophisticated use of the additional CSI knowledge available at some TXs. 

C. Active-Passive Zero Forcing with Hierarchical Quantization 

Hierarchical quantization is used for AP ZF in the same way as for Conventional ZF. This 
consists in using the CSI which is common to all the active TXs considered in the definition of 



the beamformer in (40). 



Proposition 5. The number of DoFs achieved using Active-Passive ZF with Hierarchical Quan- 
tization and the set § is 

K 

DoF^P^^(§) = V min min a^l (46) 



The two successive minimums come from the fact that it is not the same TX which is passive 
for the different streams. It is clear from (|46]) that it is optimal to choose all the Ui to be equal 
for K > 3. However, the indice of the optimal passive TX, which we denote by nnq, is now 
different from the case without HQ. It is easily obtained by looking for the passive TX bringing 
the largest improvement in number of DoFs: 

K 

TT-HQ — argmax min al^\ (47) 

ne{i,...,it} r:^ ie{i.--'^}' 

h — J- JT^n 

The maximum number of DoFs using AP-ZF with HQ follows then directly. 

Proposition 6. For K > 3, it is optimal to choose the passive TX to be TX j with j = tt-hq 
defined in ( [47] ), for all the data streams. The number of DoFs achieved with Active -Passive ZF 
based on Hierarchical Quantization is then equal to 

K 

j3^pAPZF^y min (48) 

3&{l,...,K}, 
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VII. DoF Optimal Sharing of the Feedback under a Total Feedback Constraint 

In this section, we consider the opposite side of the problem which consists in deriving how 
to distribute a maximum number B of feedback bits across the TXs and the channel vectors 
so as to maximize the number of DoFs. Since our focus remains on the number of DoFs and 
considering previous results, it is meaningful to introduce 7 = limp_>oo 5/ log2(P) which we 
call the total feedback scaling. 

Thus, we consider a constraint on the sum of the scaling coefficients of the total feedback 
transmitted through the multi-user channel feedback: 



We study first conventional ZF before extending the results to Active-Passive ZF. To optimize 
the CSI allocation efficiently, it becomes necessary to also optimize the number of users being 
served, which means that time sharing will this time be explicitly considered. 

A. Conventional Zero Forcing 

Proposition 7. With conventional ZF (with or without Hierarchical Quantization), it is optimal 
in terms of number of DoFs to share equally the number of bits across the TXs and across the 
channels to quantize and to let the number of TX being actually transmitting be equal to n for 
7 G [n(n — 1)^, (n + l)n^]. It follows that the optimal number of DoFs using Conventional ZF 
is equal to 



Proof: We study first the case without HQ. Since the number of DoFs scales as the worst 
CSI scaling across the TXs and the channel vectors, it is clearly optimal to have the same CSI 
accuracy at all the TXs and for all the channel vectors. To achieve a number of DoFs of a at 
n RXs, the number of bits to quantize a channel vector has to be equal to a{n — 1) log2(-P), 
where n is the number of transmitting TXs. Hence, the total feedback in the channel is given 
by n'^a{n — 1) log2(P) when considering the n estimates needed at the n TXs. 





(49) 



i,je{i,...,K} 




(50) 
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Let's assume that n TXs are serving n RXs with the maximal feedback scaling 7, we obtain 
that a = 7/(n^(r2 — 1)). For 7 < n'^{n — l) the number of DoFs achieved at the RXs is lower or 
equal to one so that the sum number of DoFs is equal to na = 'j/(n(n — 1)). For 7 > — 1), 
the number of DoFs at each RX reaches its maximal value of one and the sum number of DoFs 
is equal to n. Comparing the sum number of DoFs achieved by two successive configurations, 
with respectively n and n + 1 users served, leads to the value of 7 given in the proposition as 
switching point between the configurations. 

When HQ is used, the number of DoFs still scales as the minimum over the CSI scaling 
across the TXs so that it is still optimal to let all the TXs have the same CSI scaling. ■ 

Using HQ does not increase the number of DoFs when the CSI configuration can be optimized. 
However, many more configurations are optimal as the CSI can be allocated indifferently to any 
channel vector as long as the scaling of the CSI does not exceed one and all the TXs receive 
the same CSI. 

The results from Proposition [7] are very intuitive, yet the formula is not very enlightening and 
the intuition is better understood in a plot of the number of DoFs with optimal CSI sharing. 
Thus, we plot in Figure [2] the number of DoFs in terms of the total feedback scaling 7 for 
different numbers of transmitting TXs. The parts with a positive slope correspond to values of 
a smaller than one while the flat parts correspond to a saturation of the number of DoFs, i.e., 
a>l. 

The values of 7 corresponding to the saturation of the number of DoFs and to the activation 



of an additional user, respectively, are given in Appendix X-C When n TXs are transmitting, the 
slope of the number of DoFs as a function of 7 is known to be equal to l/(n^(n — 1)) and we 
can observe in the figure how the values for 7 given in the proposition fit with the observation 
in terms of saturation and intersection of the curves. 

It is possible to observe that the saturated parts are optimal for some values of 7. This follows 
from the fact that using an additional TX induces an increase of the feedback necessary (lower 
slope in the figure). Thus, a possibly large increase in 7 is necessary before reaching the point 
where it starts being more interesting to serve the additional RX and use an additional TX. 
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Fig. 2. Degrees of Freedom as a function of the total feedback scaling 7 for different number of users. 

B. Extension to Active-Passive Zero Forcing 

Our analysis for conventional ZF can be extended to Active-Passive ZF without difficulty. The 
only difference consists in the number of bits necessary to achieve a scaling of a which is then 
n{n — l)^Q;log2(P) instead of — l)Q;log2(P) since one TX (passive TX) does not need to 
be shared any CSI. Thus, it holds that a — ^ / {n{n — 1)"^) which leads to the following result. 

Proposition 8. When using Active-Passive ZF (with or without HQ), it is optimal to share 
equally the number of bits across the active TXs and across the channels, and to let the number 
of transmitting TXs be equal to n for 7 e [n(n — 1)^, (n + l)n^]. It follows that the optimal 
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number of DoFs is equal to 

DoF^P^^ = 7/(ri - l)\ if-fE [{n - lf,n{n - if] 

(5 1 ) 

DoF^P^^ = n, if-f e [n{n - 1)2, n^]. 

The proof and the plot of the number of DoFs in terms of the total feedback scaling 7 follow 
both the same pattern as conventional ZF and are omitted to avoid repetition. 

The general insight behind those results is that it is better to achieve the maximal number 
of DoFs at less users instead of serving more users with a lower number of DoFs. This is an 
intuitive consequence of the very quick increase of the size of the aggregate feedback required 
in terms of the number of TXs used. 

VIII. Simulations 

A. In the Two-User Case 

We consider two models for the imperfect channel CSI, a statistical model and RVQ. 
In the statistical model, the quantization error is modeled by adding a Gaussian i.i.d. quantiza- 
tion noise to the channel with the covariance matrix at TX j equal to diag([P~"i , P~°2 ]). This 



corresponds to the scaling in P of the variance provided in Proposition [TO] of Appendix |X-A 
The Gaussian distribution maximizes the entropy for the given variance [j42| so that we will 
obtain a priori a lower bound for the performance. Yet, it is expected that only the scaling of 
the variance will have an impact so that the statistical model should be accurate. The averaging 
is then done over 10000 realizations. 

In the RVQ, we consider a given number of feedback bits and we average over 100 random 
codebooks and 1000 channel realizations. In the simulations, we consider the following precoders: 



ZF with perfect CSI, conventional ZF [cf. (12)], Beacon ZF [cf. (20)], and Active-Passive ZF 



[cf. ( [23] )] with heuristic power control and with 3-bits power control. 

In Fig. [sj we consider the statistical model with the CSI scaling [aj^^'', a^^"*] = [1,0.5] and 
[02^'', ^2^^] = [0,0.7]. To emphasize the number of DoFs (i.e., the slope of the curve in the 
figure), we let the SNR grow large. As expected theoretically, conventional ZF scales with the 
worst accuracy and saturates at high SNR, while Beacon ZF has a positive slope and Active- 
Passive ZF performs closer to perfect ZF with a slope only slightly smaller than the optimal 
one. 




Fig. 3. Sum rate in terms of the SNR with a statistical modeling of the error from RVQ using [a^^',a^^'] — [1,0.5] and 
[4^\Qf] = [0,0.7]. 



In Fig. 4| we plot the sum rate achieved with the CSI feedback [b[^\b^^^] = [6,3] and 
[B^\b^''] = [3,6] using RVQ. From the theoretical analysis, the number of DoFs should be 
equal to zero for all the precoding schemes since the number of feedback bits used does not 
increase with the SNR. This is confirmed by the saturation of the sum rate as the SNR increases. 
Yet, the saturation occurs at a higher SNR for Beacon ZF compared to conventional ZF, and at 
an even higher SNR for Active-Passive ZF. This translates into an improvement of the sum rate 
at intermediate SNR. 




B. With Arbitrary Number of Users 

For the simulations with arbitrary number of users, only the statistical model described in the 
previous paragraph for the two-user case is considered. To model easily the use of Hierarchical 
Quantization, we simply consider that a TX has the knowledge of the channel estimate at another 
TX if this TX receives a feedback concerning this channel vector with a lower CSI scaling 



coefficient. Since we have derived that Beacon ZF [Cf. (36)] does not bring any improvement 



in number of DoFs for K > 3, we will consider in the figures only conventional ZF [Cf. (34)] 



and Active-Passive ZF [Cf. (|40])] where the transmission of 3-bits to the passive TX is allowed 
for every beamforming vector. For both precoding schemes, we will furthermore consider both 
the case of Hierarchical Quantization with random codebooks and conventional RVQ. 
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SNR [dB] 



Fig. 5. Sum rate achieved for thie arbitrarily chosen CSI scahng configuration a given in Appendix |X-D| 



We consider the performance achieved with an arbitrary chosen CSI scaling matrix to verify 
that the precoding schemes behave as expected. Thus, we consider K = 7 users and we set 
all the elements of the CSI scaling matrix a equal to 1 at the exception of two coefficients 
corresponding to different TXs and RXs set to and 0.3, respectively. The CSI scaling matrix 



is given explicitly in Appendix X-D as well as the number of DoFs obtained analytically for 
that setting. 

In Fig. |5} we plot the average sum rate achieved for the previous setting in terms of the SNR. 
We can observe that the schemes using HQ achieve a much larger number of DoFs (i.e., slope in 
terms of the SNR) which is in agreement with the theoretical results. Furthermore, the increase 
in number of DoFs translates to better performance at intermediate SNRs. 
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IX. Conclusion 

In this work, we have introduced a new model, called distributed CSI-MIMO channel, con- 
sisting in a multicell downlink channel where each transmitter has its own local estimate of the 
whole multi-user channel. We have shown that conventional ZF precoding applied without taking 
into account the CSI discrepancies achieves far from the maximal number of DoFs and is limited 
by the worst accuracy of the CSI over the whole multi-user channel. This is particularly striking 
as the bad estimate of the channel to one particular user at a unique TX reduces the number 
of DoFs of all the users. This represents a different behavior from the conventional MIMO BC. 
In the particular case with only two users, we have provided a precoding scheme achieving the 
number of DoFs corresponding to the most accurate CSI across the TXs. With arbitrary number 
of users, the number of DoFs achieved by conventional ZF has been derived and precoding 
schemes to improve over this number of DoFs value have been provided. Particularly, it has 
been shown how using codebooks with a hierarchical structure to quantize the CSI could lead 
to a significant number of DoFs improvement. Moreover, considering the opposite problem of 
optimizing the sharing of the CSI feedback under a total feedback constraint, we have derived 
a number of DoFs maximizing CSI configuration when ZF is used. Finally, simulations have 
confirmed that the novel precoding schemes outperform known linear precoding schemes at 
intermediate SNRs. 

This paper represents the first step on our work on the DCSI-MIMO channel and many 
problems remain open. Firstly, the DCSI-MIMO channel has been studied asymptotically for 
analytical tractability and the extension to finite SNR represents a challenging problem. The 
design of other robust precoders forms also an interesting problem with a strong potential. Finally, 
there are many other scenarios where distributed TXs want to cooperate but cannot practically 
share the exact same CSI (Relay channels, interference channels,...). In such settings, similar 
analysis could be developed to make the transmission more robust to the CSI discrepancies 
which are likely to exist in practical settings. 
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X. Appendix 

A. Some Results on Vector Quantization 

We consider the quantization of the unit-norm complex vector h E over a codebook C 
where both the channel to quantize and the elements of the codebook are multiplied by a unit- 
norm complex number (i.e., are rotated in the complex space) so as to let the first element of 
the vector be real valued. The quantized vector h is then obtained as 

h, = argmin ||c — (52) 

The rotation is done so as to optimize the performance of the quantization as it clearly leads to 
better performance. Since the norm is conserved when considering the canonical isomorphism 
from C^*" to M^^^, we can consider for the quantization the vectors as elements of M?^ made of 
the stacked real and imaginary parts of the original vector. 

With the first coefficient real valued, it is only necessary to consider M^^^^. Thus, a vector 
u = [ui,U2, ■ ■ ■ ,ukY ^ with its first coefficient real valued is represented in M^^'"^ as 
■UR2K-1 and is defined as 



A 

J2X-1 = 



Re(Mi) Re{u2) ■ ■ ■ Re{uK) Im('U2) Im('U3) . . . Im(Mx) 



(53) 



We can then define the angle between Uj^2k-i and v^2k-i in ^ as 

A[U^2K-l^V^2K-l) = arCCOS I rrr. . (54) 



Using the conservation of the norm by the canonical isomorphism, the quantization in ( |52| ) is 
rewritten as 

h^2K-i = argmin ||c]r2A'-i — ^]fj2A'-i ||^ (55) 
= argmin {2 - 2c^2K-ih^2K-i). (56) 

Cj,2if-l6Wg2A-l 

We can see from ( |56l ) that the quantization scheme aims at maximizing c^2K~ih^2K^i. This 
figure of merit can be linked to the commonly used chordal distance d(«) which is defined for 



two vectors as |43| 



d(c]R2K-i, h,]K2K-i) = a/ sin {A{c^2K-i,h^2K-i)) (57) 



1 - |cLA--i^R2if-ip. (58) 
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Thus, minimizing the chordal distance is equivalent to maximizing |c^2jf-i^iR2K-i p. This is then 



equivalent to the quantization scheme ([54]) if the half-space where h^2K-i belongs is known. This 
requires solely one additional bit. Since we are interested in the scaling of the number of bits, 
this will not make any difference. Consequently, we will study in the following the quantization 
scheme based on the minimization of the chordal distance 



c^2K-l6Cg2K-l 



h,R2x-i = argmin ysin (Z(cR2if-i, h.R2K-i)). (59) 



On that account, we now study the quantization scheme given by p9} over the Grassmannian 
manifold of dimensions (1, 2K — 1) in the field M (i.e., on the unitary ball in M^^^^). This 



quantization scheme is studied (in a much more general form) in [43 1 and we start by recalling 



some results. We then derive some new properties which will be needed in the derivations. 



Proposition 9 ( |43|, Corollary 2). The cumulative distribution function (CDF) ofd {h,c 



sin^(Z(^, c)) where c G ^ is an element of a random codebook is bounded as 

C2K-ix^'^ < F(x) = Pr{sin^(Z(h, c)) < x} < C2k-ix'^~\1 - x)^. (60) 
where C2k-i = T{K - l/2)/{T{K)T{l/2)). 



Proposition 10 ( [43|, Theorem 2). When the size L = 2" of the random codebook is sufficiently 



large (c2x-f ^/(^ < 1 is necessary), then it holds that 

2K — 1 _i D//t' 1^ r . . .,r ... ,r 



2K + 



ic-/Jf-)2-/(--) < E,,,[nnnsin^(Z(^,c))] < ^c-/Jf-)2-/(--). (61) 



Proposition 11. When the size L = 2^ of the random codebook is sufficiently large, the 
expectation of the logarithm of the quantization error is bounded as 



B + \0g2{c2K-l) ^ ^ 

(K-1) ~ '^''^ 



log2 ^minsin^(Z(h,, c)) 



< 



B + \og2ic2K~i) +log2(e) 



■ (62) 



Proof: Upper Bound: The derivation of an upper bound follows the same idea as the proof 
in Appendix B of p3| which derives an upper bound for the same expectation as in this proof. 



We will do the abuse of notation consisting in removing the index •^2k-i in the derivations but it will be clear that any 
mention of an angle will refer to the angle defined in R^^^~^. 
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only without the logarithm. We start by recalling a Lemma from [ [43| which follows easily from 
the definition but is helpful. 

Lemma 1 ( (43], Lemma 3). The empirical distribution function minimizing the distorsion over 
a given L = 2^ is 



F*.(x) 



ifx<Q 

LF{x) ifO<x<x* (63) 

1 if X > X* 
where x* satisfies LF{x*) = 1 and F{x) = Pr{sin^(Z(h,, c))| < x}. 

Note that Lemma [T] corresponds to the optimal codebook minimizing the average distance and 
is thusly a lower bound for the distorsion. We can then write 



'C,h 



— log I minsin (Z(h., c)) 



Pr{— log ( minsin^(Z(h., c)) I > z}dz 



Pr{minsin^(Z(h,, c)) < e ^}dz 







cec 



(64) 
(65) 



-log(a;*) p—oo 

< I dz+ LPr{sin2(Z(h,c)) < e-"}dz (66) 

'o J~log{x*) 



where (64) is obtained by exploiting the fact that the term in the expectation is a positive random 



variable and ( [66] ) follows from the previous lemma since the optimal codebook has a CDF taking 
larger value that the CDF for a random codebook for every value of the argument x. 

Following the same approach as the proof in Appendix B of |43| , we define Fo(x) = 
C2K-iX^~^ and Xq so that LFo(xo) = 1. Let also define F^y^{x) = C2k-iX^~^{^ — xY^I"^ 
and Xub so that LFub(a;ub) = 1- Finally, we define Fubub(a;) = c^K-i^^'^i)' — ^^)^^^'^ and Xubub 
so that -LFubub(a;ubub) = 1- 

It holds by construction that Xub < a^* < a^o since we know from Proposition [9] that Fo(2;) < 
F(x) < Fub(x). Clearly it follows that (1 - s)"^/^ < (1 - x^Y^I'^ for x G [0, sq] so that 
Fub(a^) < Fubub(a;) for x E [0,xo], which finally implies Xubub < a^ub- We can then use these 
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relations to derive an upper bound for ([66]). 



E, 



'C,h 



log ( minsin^(Z(h,, c)) 



< 



< 



< 



■log(x*) 

dz + 

■ logCXubub) 



log(z*) 



LF{e-')dz 



dz + 



LF{e-')dz 



log(Xubub) 



dz + 



■ log(a;o) 
oo 

- log(a;o) 



LF 



ubub 



[e-')dz. 



(67) 
(68) 
(69) 



Equation ([68]) follows from Xubub < x* < xq and ( [69] ) follows from the fact that Fub(a;) < 
Fubub(a;) for x E [0, Xq]. We now replace Fubub(»)» a^ubub. and Xq by their expressions to evaluate 
the integral. 



E, 



C,h 



— log I minsin (Z(h., c)) 



< 



1 



K-1 
1 



K-1 
1 



log 
log 



Lo 



2K~l 



+ 



LC2K-1 



- log(xo) 



K-1 



LC2K-1 
(log {Lc2K-l) + I) + oil] 



+ 



'l-{Lc2K-i)^^y'\K-l) 



(70) 

(71) 
(72) 



as L increases. Dividing by log(2) yields the final upper bound. 

Lower Bound: We start from the lower bound for the CDF given in Proposition [9] It has a 
form very similar to the CDF for the quantization of a complex vector in the unit-ball in 
which is usually used for multiple-antenna BC. Hence, we adapt the approach of the proof of 
Lemma 3 by Jindal in yj to the current setting. 

From the lower bound in Proposition [9] we write 

,2/ 



Pr{mm ( sin^(Z(^, c)) ) < z} > 1 - (1 - Csir-ix^^^^^^^ 



(73) 
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A lower bound for the expectation of the logarithm can then be calculated as follows. 



E 



'C,h 



■log I minsin^(Z(h,, c)) 



Pr|minsin^(Z(h,,c)) < e ''jd; 



POO 

Jo 



fe=0 
L 



K 



k=l ^ ^ 



K -1 



(74) 
(75) 

(76) 

(77) 
(78) 



where we have defined f{p) = Ylk=i it) p G N. To compute the value of f{L), 

we will use the following relation given in [ j44[ Sec. 0.155]. 

'n\ a'^+i (a + 1)"+^ - 1 



E 

fc=0 



kj k + 1 



n + 1 



(79) 



We now rewrite f{L) in order to be able to apply (79^ 



L 

L-l r 



■1) 



E 

k=l 
L-l 



L+l ^2K-1 

L 



L - 1 
k - 1 



E 

k=l 



L-l 
k-1 



\L+1 ^2K-1 



fe'=0 ^ ^ 



L-l 

E 

fe=i 



L-l 
k 

'L- 1 



(-1) 



k 



k+1 ^2K-l 
k 



{-C2K-1 + 1)^-1 



fiL-l) 



E 



1 - {-C2K-1 + K 
P 



(80) 
(81) 
(82) 

(83) 

(84) 
(85) 
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Furthermore we have the two following relations: 

L 



log(L) < V - < log(L) + 1 



OO r 



log(l — x) = — ^ — , for X G [—1, 1]. 



(87) 
(88) 



L=l 



Using these properties and dividing by log(2), we can obtain the final lower bound as 

L , ^ L 

/ „ ~ \ I , 



-'C,h 



— log! minsin (Z(h., c)) 

cgC 



1 >^ 1 1 >^ 

- {K-1) log(2) ^ p " {K-1) log(2) ^ 
log2(L) 1 



{l-C2K~ir 



p 



> 



{K - 1) {K- 1) log(2) ^ 



1 - C2i^-l)^ 
P 



(89) 
(90) 

(91) 



where we have used that the constant C2k-i is smaller than one to apply ([88]) and obtain the 
term log2(c2x-i). ■ 

B. Proof of Theorem [?] 

The proof generalizes to the distributed CSI configuration the proof of Theorem 4 in Appendix 
IV of [[T|, which derives the number of DoFs for the multiple- antenna BC with finite rate 
feedback. The generalization is non-trivial due to the fact that in the DCSTMIMO channel it is 
not only the inner product between the beamformer and the channel h^t^P which matters, but also 
the coherency between the coefficients used at the different TXs. Following this difference, we do 
not use the conventional Grassmannian quantization scheme but we use instead the quantization 



scheme described in Subsection II-B In a word, it consists in exploiting the fact that the norm 
is conserved by the canonical isomorphism between and M^^, to use the Grassmannian 
quantization in the real subspace M^^^^^ The reduction to 2K — 1 real dimensions comes from 
the multiplication by a unit-norm complex number to let the first coefficient be real valued. We 



then define the angles between vectors in that real linear space. We refer to Appendix X-A for 
more detail. 

The estimation error made at TX j about the channel vector hi is denoted by 5^ such 



that 5^ = hi — h)^' and the estimation error vectors made at TX j are stacked in the estimation 
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error matrix A^^^ defined as 



(92) 



We also denote by wp-* = 



^(i)| 



tyW the (Conventional ZF) unit-norm beamformer computed at 
TX j and by u* = t*/\\t*\\ the same beamformer based on perfect CSI. We omit in this proof 
the superscript •^'^^ for clarity. 

Furthermore, we consider in the following that the accuracy of the channel estimates increases 

(i) 

with the SNR, i.e., the CSI scaling coefficients al are all positive. If there is one pair of 
indices for which ap"* = 0, then the Euclidean distance between w^-'"' and ul does not 

decrease with P for all k such that the number of DoFs at all RXs vanishes. When this is not 
the case, the norm of the channel estimation errors can be approximated as 

= 2 - 2{h\'^)^hi 



2-2\(h9YhA 



sin2(Z(h?\ h,)) + o(sin2(Z(h?\ h,))) 



(93) 
(94) 
(95) 

(96) 
(97) 



where (95) is verified when the channel estimate belongs to the same half-space as the true 



channel vector. This holds true in this work for the reason explained in Appendix |X- A Equal 



ity ( |96| ) follows from the definition of the angle between two vectors and pT) is obtained via a 
Taylor expansion on the first order in the estimation error. 

From ([92l), we conclude that the square norm of the estimation error H^p"*!!^ is asymptotically 



equal to the chordal distance between the channel estimate and the true channel sin^(Z(h,p'', hi)) 
when the SNR increases. The chordal distance corresponds to the distance minimized by the 
Grassmannian quantization so that this will allow us to apply the theoretical results provided in 



Appendix X-A As a preliminary step, we will now evaluate the impact of the estimation error 



into the computation of the beamformers, i.e., evaluate the norm of the vector — u* for all j. 
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Lemma 2. Let's assume that Vi, > 0, then it holds asymptotically as P increases 

2 



E 



E 



log2 ( .jnax^ {sm\Z{h\^\h^)) 



+ 0(1). 



(98) 



Proof: We consider w.l.o.g. the precoding at TX j. Since Vi, a^'' > 0, the estimation error is 
infinitely small as P increases and we can do a first order approximation of the channel inverse 
and write 



H 



-1 



-1 



-1 



(99) 



Derivation of the Upper Bound: After multiplying by ej to obtain the j-th beamformer, 



the Right Hand-Side (RHS) of ( [99] ) can then be upper bounded as follows 

||(H-i - (H(^^))-i)e,f < \\U-'A^^^U-'\\l + oi\\A^^^\l) 

o(||A(^)||i) 

<^^AL(H)(5^||5?f) + o(l|A(^-)||^) 



< ||H-i^||A(-'')||^ 



K 



(100) 
(101) 

(102) 



k=l 



with A^j„(H) denoting the smallest eigenvalue of the channel matrix H. We then take the 
expectation of the logarithm of this term according to both the channel estimation error and 
the channel distribution. The term log(A^j„(H)) is shown to be integrable and its expectation 



is given in [|45|. The result follows by upper-bounding each of the estimation errors by 



(i) 



the error which is asymptotically the largest, i.e., the one corresponding to the smallest a 
Derivation of the Lower Bound: we start by factorizing the estimation error matrix as follows 



^0') = ^0') diag([||5i- 



(i)i 



(i)i 



(103) 



with the columns of A*^^^ consequently normalized to be unit-norm. We then assume w.l.o.g. 
that the asymptotic largest estimation error corresponds to the channel hi (i.e., the smallest CSI 
scaling coefficient is ai ^). Furthermore, we consider for the sake of exposition that no other 
channel has the same CSI scaling coefficient. The proof holds similarly if this condition does 
not hold. We can then write 

E[log(||(H-i - (H(^-))-i)e,|n] = E[log(||5i^')f )] +2E[log(||H-i^)] 



+ E[log(||(H-iA(^-)diag([l,||(5^^')||/||(5[ 



-1, 



+ o(E[log(||A(^-)||^)]) 
(104) 
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where we have defined H ^ = H ^/||H ^||p. The absolute value | log(||H ^||p)| can be upper 
bounded as in (|102[) by | log(fCAmin(H))| whose expectation is shown to exist in p3|, thus its 



expectation also exists. Similarly, the absolute value of the last term of the RHS in ( |104[ ) can 
be upper-bounded by an integrable function such that it is also integrable and its expectation is 
then a 0(1), i.e., it remains bounded as the SNR P increases. This concludes the proof. ■ 

Proof of Theorem |4} We will now use Lemma^to prove the theorem. We consider for simplicity 
that the CSI scaling coefficients are all different. The proof easily extends to the configurations 
with some coefficients equal and this is done solely to simplify the exposition. We assume w.l.o.g. 
that the TX with the smallest CSI scaling coefficient is TX 1. 
DoF Lower Bound : We denote by Ui G C^^^ the beamforming vecto^such that 



\fje{i,...,K}, {n,}, = K^^)}, 



P/K ■ 



(105) 



We start from the number of DoFs expression in ([6]) that we rewrite as 



DoF 



cZF 



: 1 - lim E 

P— !>00 



■■— lim E 

P— 5>00 



'H,{>V,,i} 



\og2 [1 + ^T.k^i\\hir\hfuk 

l0g2(P) 

l0g2(P) 



(106) 



(107) 



To obtain a lower bound for the number of DoFs, we need to derive an upper bound for the 
leaked interference in ( |107| ). We define first the selection matrices E, = diag(ej) and write 



hfuk 



K 



K 

i=i 



(i) 
k 



(108) 
(109) 



The vector Ui corresponds to the normahzed version of ti. Yet, it is exactly unit-norm only when all the TXs share the 
same channel estimate. It is otherwise impossible for the TXs to jointly normalize the beamformer based on different channel 
estimates. This does not represent a problem in practice because the power constraint is exactly fulfilled in average over the 
channel estimation errors. 
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which we insert in (1 1071) to obtain 



DoF: 



cZF 



> - lim Eh,{w,.,} 



> - lim Eh,{w.,,} 



l0g2(P) 



log2(P) 



(110) 



(111) 



From ( |99[ ), the TX whose computed beamformer exhibits the largest mean square error \\u^ — 
at arbitrarily large SNR P is the TX to whom the lowest CSI scaling coefficient belongs, 
which is by assumption TX 1. We can then write 



DoFf^> lim 

P— !-00 



E 



H,{W,j} 



\U 



(1) 



u 



* Il2 



E 



> lim 

P— >-oo 



H,{W,j} 



l0g2(P) 

log2 (maxi=i,„.,x, (sm'^{Z{hf\hi)) 



log2(P) 



minj=i,,„,x ^ + log2(c2i^-i) + log2(e) 



> lim 

P— >-oo 



(ii'-l)l0g2(P) 



mm a 

i=l,...,K 



(1) 



(112) 

(113) 

(114) 
(115) 



where ( 112 ) /5 obtained by permuting the expectation and the limit, ( 1 1 3| ) follows from Lemma^ 



and we have used Proposition 11 to obtain inequality (114). The last equation (115) corresponds 



to the smallest CSI scaling coefficient and provides the lower bound. 
DoF Upper Bound: We now derive an upper bound for the number of DoF s, which means a 



lower bound for the interference. We proceed similarly to ( |108| ) but this time to obtain a lower 
bound for the interference remaining after precoding: 



hfu,\ = \hfak\\\J2^,iu^k^-ul) 
hfa,\\\E,{ui'^-ul)\\ 



> 



ELe, 



(i) 



|pH,(l)||| (1) _ * 
1^1 "fc lll^fc 



(116) 

(117) 
(118) 



u 



where we have defined ak - 
ul)/\\u^j!'^ —ul \\. The two vectors forming each of the two inner products in ( |1 18[ ) are isotropically 



E;1iE,«- 1.^)11 and bi'> ^ iu['> - 



.U) 



(1) A u.W 
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distributed so that the expectation of their logarithm is finite. Inserting (|118|) inside the number 



of DoFs formula in ( |107| ), we can write the lower bound 

limp^oo EH,{Wi,j } 



Dof: 



cZF 



l0g2(P) 



linip^oo E 



> 



log2 [\\ul^ 



log2(P) 



(119) 
(120) 

with inequality ( 121 1 obtained from Lemma^ The proof concludes in the same way as the proof 
of the upper bound after using Proposition [77] 



limp_,oo E 



> 



-Iog2 (maXi=i^,„^K, (sin^{Z{h\^\hi)) 



C. Numerical Values for the Total Feedback Scaling 7 



Number of transmitting TXs: 
n 


Saturation of the number of DoFs: 

n'^{n — 1) 


Activation of the {n + l)-th TX: : 

+ 1) 


1 





2 


2 


4 


12 


3 


18 


36 


4 


48 


80 


5 


100 


150 



D. CSI Scaling Matrix Used in the Simulations 

For Fig. [5| the CSI scaling matrix arbitrarily chosen is 

1111 1 1 
11111 1 1 
11111 1 1 

a = 1 1 1 1 1 1 1 

1 1 1 1 1 0.3 1 
11111 1 1 
11111 1 1 



(122) 
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Preceding Scheme 


Number of DoFs 


Conventional ZF 





Active-Passive ZF 


2.1 


Conventional ZF with HQ 


5.3 


Active-Passive ZF with HQ 


6.3 
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